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Abstract 

We review the basic hypotheses which motivate the statistical framework used 
to analyze the cosmic microwave background, and how that framework can be en- 
larged as we relax those hypotheses. In particular, we try to separate as much as 
possible the questions of gaussianity, homogeneity and isotropy from each other. 
We focus both on isotropic estimators of non-gaussianity as well as statistically 
anisotropic estimators of gaussianity, giving particular emphasis on their signa- 
tures and the enhanced "cosmic variances" that become increasingly important 
as our putative Universe becomes less symmetric. After reviewing the formalism 
behind some simple model-independent tests, we discuss how these tests can be 
applied to CMB data when searching for large scale "anomalies". 
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1 Introduction 



According to our current understanding of the Universe, the morphology of the cosmic 
microwave background (CMB) temperature field, as well as all cosmological structures 
that are now visible, like galaxies, clusters of galaxies and the whole web of large-scale 
structure, are probably the descendants of quantum process that took place some 10~ 35 
seconds after the Big Bang. In the standard lore, the machinery responsible for these 
processes is termed cosmic inflation and, in general terms, what it means is that micro- 
scopic quantum fluctuations pervading the primordial Universe are stretched to what 
correspond, today, to cosmological scales (see (U El 13] for comprehensive introductions 
to inflation.) These primordial perturbations serve as initial conditions for the process 
of structure formation, which enhance these initial perturbations through gravitational 
instability. The subsequent (classical) evolution of these instabilities preserves the main 
statistical features of these fluctuations that were inherited from their inflationary origin 
- provided, of course, that we restrain ourselves to linear perturbation theory. 

However, given that matter has a natural tendency to cluster, and this inevitably 
leads to non-linearities (not to mention the sorts of complications that come with bary- 
onic physics), the structures which are visible today are far from ideal probes of those 
statistical properties. CMB photons, on the other hand, to an excellent approximation 
experience free streaming since the time of decoupling (z ~ 1100), and are therefore 
exempt from these non-linearities (except, of course, for secondary anisotropies such 
as the Rees-Sciama effect or the Sunyaev-Zel'dovich effect), which implies that they 
constitute an ideal window to the physics of the early Universe - see, e.g., [U El [6]. In 
fact, we can determine the primary CMB anisotropies as well as most of the secondary 
anisotropies on large scales, such as the Integrated Sachs-Wolfe effect, completely in 
terms of the initial conditions by means of a linear kernel: 



where rf is conformal time, and S l denote the initial conditions of all matter and metric 
fields (as well as their time derivatives, if the initial conditions are non-adiabatic.) Here 
Ki is a linear kernel, or a retarded Green's function, that propagates the radiation field 
to the time and place of its detection, here on Earth. Since that kernel is insensitive 
to the statistical nature of the initial conditions (which can be thought of as constants 
which multiply the source terms), those properties are precisely transferred to the CMB 
temperature field 0. 

The statistical properties of the primordial fluctuations are, to lowest order in per- 
turbation theory, quite simple: because the quantum fluctuations that get stretched 
and enhanced by inflation are basically harmonic oscillators in their ground state, the 
distribution of those fluctuations is Gaussian, with each mode an independent random 
variable. The Fourier modes of these fluctuations are characterized by random phases 
(corresponding to the random initial values of the oscillators), with zero mean, and vari- 
ances which are given simply by the field mass and the mode's wavenumber k = 27r/A. 




AT(n;rj ) 
T(Vo) 
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The presence of higher-order interactions (which exist even for free fields, because of 
gravity) changes this simple picture, introducing higher-order correlations which de- 
stroy gaussianity - even in the simplest scenario of inflation [7J [HI [9]. However, since 
these interactions are typically suppressed by powers of the factor GH 2 ~ 1CT 12 , where 
G is Newton's constant and H the Hubble parameter during inflation, the corrections 
are small - but, at least in principle, detectable [TU1 [TT1 [T2] . 

Since these statistical properties are a generic prediction of (essentially) all infla- 
tionary models, they can also be inferred from two ingredients that are usually assumed 
as a first approximation to our Universe. First, since inflation was designed to stretch 
our Universe until it became spatially homogeneous and isotropic, it is reasonable to 
expect that all statistical momenta of the CMB should be spatially homogeneous and 
rotationally invariant, regardless of their general form. Second, in linear perturbation 
theory [1 3j where we have a large number of cosmo logical fluctuations evolving inde- 
pendently, we can expect, based on the central limit theorem, that the Universe will 
obey a Gaussian distribution. 

The power of this program lies, therefore, in its simplicity: if the Universe is indeed 
Gaussian, homogeneous and statistically isotropic (SI), then essentially all the informa- 
tion about inflation and the linear (low redshift) evolution of the Universe is encoded 
in the variance, or two-point correlation function, of large-scale cosmological structures 
and/or the CMB. As it turns out, the five year dataset from the Wilkinson Microwave 
anisotropy probe (WMAP) strongly supports these predictions [T4"[ ITT] . Moreover, the 
measurements of the CMB temperature power spectrum by the WMAP team, alongside 
measurements of the matter power spectrum from existing survey of galaxies [T5l [T6] 
and data from type la supernovae fTf \ [T8| [T9], have shown remarkable consistency with 
a concordance model (ACDM), in which the cosmos figures as a Gaussian, spatially 
flat, approximately homogeneous and statistically isotropic web of structures composed 
mainly of baryons, dark matter and dark energy. 

However, while the detection of a nearly scale-invariant and Gaussian spectrum is a 
powerful boost to the idea of inflation, just knowing the variance of the primordial fluc- 
tuations is not sufficient to single out which particular inflationary model was realized 
in our Universe. For that we will need not only the 2-point function, but the higher 
momenta of the distribution as well. Therefore, in order to break this model degeneracy 
we must go beyond the framework of the ACDM, Gaussian, spatially homogeneous and 
statistically isotropic Universe. 

Reconstructing our cosmic history, however, is not the only reason to explore further 
the statistical properties of the CMB. The full-sky temperature maps by WMAP [20. [TT] 
have revealed the existence of a series of large-angle anomalies - which, incidentally 
were (on hindsight) already visible in the lower-resolution COBE data |21| . These 
anomalies suggest that at least one of our cherished hypothesis underlying the standard 
cosmological model might be wrong - even as a first-order approximation. Perhaps 
the most intriguing anomalies (described in more detail in other review papers in this 
volume) are the low value of the quadrupole and its alignment of the quadrupole (£ = 2) 
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with the octupole (1 = 3) [2211251 1231 12511251 I27j. the sphericity [26] (or lack of planarity 
[28J), of the multipole i = 5, and the north-south asymmetry [291 l30| EH [321 [33J. In 
the framework of the standard cosmological model, these are very unlikely statistical 
events, and yet the evidence that they exist in the real data (and are not artifacts of 
poorly subtracted extended foregrounds - e.g., [34] ) is strong. 

Concerning theoretical explanations, even though we have by now an arsenal of ad- 
hoc models designed to account for the existence of these anomalies, none has yet quite 
succeeded in explaining their origin. Nevertheless, they all share the point of view that 
the detected anomalies might be related to a deviation of gaussianity and/or statistical 
isotropy. 

In this review we will describe, first, how to characterize, from the point of view of 
the underlying spacetime symmetries, both non-gaussianity and statistical anisotropy. 
We will adopt two guiding principles. The first is that gaussianity and SI, being com- 
pletely different properties of a random variable, should be treated separately, whenever 
possible or practical. Second, since there is only one type of gaussianity and SI but vir- 
tually infinite ways away from them, it is important to try to measure these deviations 
without a particular model or anomaly in mind - although we may eventually appeal 
to particular models as illustrations or as a means of comparison. This approach is not 
new and, although not usually mentioned explicitly, it has been adopted in a number 
of recent papers [331 13"6"] . 

One of the main motivations for this model-independent approach is the difficult is- 
sue of aprioristic statistics: one can only test the random nature of a process if it can be 
repeated a very large (formally, infinite) number of times. Since the CMB only changes 
on a timescale of tens of millions of years, waiting for our surface of last scattering 
to probe a different region of the Universe is not a practical proposition. Instead, we 
are stuck with one dataset (a sequence of apparently random numbers), which we can 
subject to any number of tests. Clearly, by sheer chance about 30% of the tests will give 
a positive detection with 70% confidence level (C.L.), 10% will give a positive detection 
with 90% C.L., and so on. With enough time, anyone can come up with detections of 
arbitrarily high significance - and ingenuity will surely accelerate this process. Hence, 
it would be useful to have a few guiding principles to inform and motivate our statistical 
tests, so that we don't end up shooting blindly at a finite number of fish in a small 
wheelbarrow. 



This review is divided in two parts. We start Part I by reviewing the basic statistical 
framework behind linear perturbation theory (^2]). This serves as a motivation for ^3j 
where we discuss the formal aspects of non-Gaussian and statistically isotropic models 
(S |3.1 ), as well as Gaussian models of statistical anisotropy ( §3.2 ). Part II is devoted to 
a discussion on model-independent cosmological tests of non-gaussianity and statistical 
anisotropy and their application to CMB data. We focus on two particular tests, 
namely, the multipole vectors statistics (Q and functional modifications of the two- 
point correlation function (Sj5|. After discussing how such tests are usually carried out 
when searching for anomalies in CMB data ( ^6.1 ), we present a new formalism which 
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generalizes the standard procedure by including the ergodicity of cosmological data as 
a possible source of errors (j |6.2 ). This formalism is illustrated in Sj7j where we carry a 
search of planar-type deviations of isotropy in CMB data. We then conclude in |8j 
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Part I 

The linearized Universe 



2 General structure 

We start by defining the temperature fluctuation field. Since the background radiation 
is known to have an average temperature of 2.725K, we are interested only in devia- 
tions from this value at a given direction n in the CMB sky. So let us consider the 
dimensionless function on S 2 : 



where To = 2.725 K is the blackbody temperature of the mean photon energy dis- 
tribution - which, if homogeneity holds, is also equal to the ensemble average of the 
temperature. 

In full generality, the fluctuation field is not only a function of the position vector 
n, but also of the time in which our measurements are taken. In practice, the time and 
displacement of measurements vary so slowly that we can ignore these dependences 
altogether. Therefore, we can equally well consider this function as one defined only on 
the unit radius sphere S 2 , for which the following decomposition holds: 



Since the spherical harmonics Ye m (h) obey the symmetry Y e * m (n) = (— l) m Ye _ m (n), the 
fact that the temperature field is a real function implies the identity a* lm = (— l) m a^_ m . 
This means that each temperature multipole £ is completely characterized by 2£ + 1 
real degrees of freedom. 

2.1 From inhomogeneities to anisotropies: linear theory 

The ultimate source of anisotropies in the Universe are the inhomogeneities in the 
baryon-photon fluid, as well as their associated spacetime metric fluctuations. If the 
photons were in perfect equilibrium with the baryons up to a sharply defined moment 
in time (the so-called instant recombination approximation), their distribution would 
have only one parameter (the equilibrium temperature at each point), so that photons 
flying off in any direction would have exactly the same energies. In that case, the 
photons we see today coming from a line-of-sight n would reflect simply the density 
and gravitational potentials (the "sources") at the position Rh, where R is the radius 
to that (instantaneous) last scattering surface. Evidently, multiple scatterings at the 
epoch of recombination, combined with the fact that anisotropies themselves act as 
sources for more anisotropies, complicate this picture, and in general the relationship 




o 



(2) 




(3) 
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of the sources with the anisotropics must be calculated from either a set of Einstein- 
Boltzmann equations or, equivalently, from the line-of-sight integral equations coupled 
with the Einstein, continuity and Euler equations [5]. 

Assuming for simplicity that recombination was instantaneous, at a time rjn, the 
linear kernels of Eq. Q reduce to Ki(x',rj';n) — » /3i5(r}' — t]r)5(x' — nR), where 
R = f]o — T]r and $ are constant coefficients. The photon distribution that we measure 
on Earth would therefore be given by: 

0(n) w Yl^^' = m ^ = ^) ■ W 

i 

We can also express this result in terms of the Fourier spectrum of the sources: 

J ^e ik -* R S% VR ) . (5) 

Now we can use what is usually referred to as "Rayleigh's expansion" (though Watson, 
in his classic book on Bessel functions, attributes this to Bauer, J. f. Math. LVI, 1859): 

e** = 4vr ]T i l 3l{kx) Y? m {k) Y lm {x) , (6) 

where je(z) are the spherical Bessel functions. Substituting Eq. ^ into Eq. ^ we 
obtain that: 

/r (fih 
d 2 hY; m (h)Q(n) « J -^9^) x 4rri E j e (kR)Y; m (k) , (7) 

where we have loosely collected the sources into the term 0(fc) = ^2, i fiiS' l (k,r]R). This 
expression conveys well the simple relation between the Fourier modes and the spherical 
harmonic modes. Therefore, up to coefficients which are known given some background 
cosmology, the statistical properties of the harmonic coefficients ag m are inherited from 
those of the Fourier modes Q(k) of the underlying matter and metric fields. Notice 
that the properties of the a&n's under rotations, on the other hand, have nothing to do 
with the statistical properties of the fluctuations: they come directly from the spherical 
harmonic functions Yi m . 



2.2 Statistics in Fourier space 

The characterization of the statistics of random variables is most commonly expressed 
in terms of the correlation functions. The two-point correlation function is the ensemble 
expectation value: 

C(k,k') = (Q(k)Q(k')} . (8) 

In the absence of any symmetries, this would be a generic function of the arguments k 
and k', with only two constraints: first, because 0(x) is a real function, Q*(k) = Q(—k), 
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hence in our definition C*(k,k') = C(—k,—k'); and second, due to the associative 
nature of the expectation value, C(k, k') = C(k', k). It is obvious how to generalize this 
definition to 3, 4 or an arbitrary number of fields at different A;'s (or "points".) 

Let us first discuss the issue of gaussianity. If we say that the variables Q(k) are 
Gaussian random numbers, then all the information that characterizes their distribution 
is contained in their two-point function C(k, k'). The probability distribution function 
(pdf) is then formally given by: 



P[Q(k),G(k')] ~exp 



e(fc)e(fc') 

2C(k,k') 



In this case, all higher-order correlation functions are either zero (for odd numbers of 
points) or they are simply connected to the two-point function by means of Wick's 
Theorem: 

N 

(e(h)Q(h) . . . e(4v)) G = E II BZiefaefc)) , (9) 

i,j a=l 

where the sum runs over all permutations of the pairs of wave vectors and Bij are 
weights. 

Second, let's consider the issue of homogeneity. A field is homogeneous if its ex- 
pectation values (or averages) do not dependent on the spatial points where they are 
evaluated. In terms of the iV-point functions in real space, we should have that: 

(Q(xi)6(f 2 ) • • • @0?at)) H °T g ' C N (xi - x 2 , ...,xjv_i - xn) ■ (10) 
Writing this expression in terms of the Fourier modes, we get that: 

(6(f x )e(f 2 )...e(^)> = / d * kl d * kN er^er^ . . . e -*»*» 

x (e(h)e(k 2 )...e(k N )) . (ii) 



Homogeneity demands that the expression in Eq. (11) is a function of the distances 
between spatial points only, not of the points themselves. Hence, the expectation 
value in Fourier space on the right-hand-side of this expression must be proportional 
to 8{k\ + k 2 + ■ ■ ■ + fcjv). In other words, the hypothesis of homogeneity constrains the 
iV-point function in Fourier space to be of the form: 

(e(h)G(k 2 ) . . . &(k N )) u = (2vr) 3 iV(fc 1 , k 2 ,..., k N ) 5(h +k 2 + ... + k N ). (12) 

Notice that the "(iV — l)-spectrum" in Fourier space, N, can still be a function of the 
directions of the wavenumbers ki (it will be, in fact, a function of iV — 1 such vectors, 
due to the global momentum conservation expressed by the 5-function.) Models which 
realize the general idea of Eq. ( 12 ) correspond to homogeneous but anisotropic universes 
|3Tl [3H [391 S0]- 
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There is a useful diagrammatic illustration for the iV-point functions in Fourier 



space that enforce homogeneity. Notice that we could use the 5-function in Eq. (12) 



to integrate out any one of the momenta ki in Eq. ( |ll[ ). Let us instead rewrite the 
5-functions in terms of triangles, so for the 4-point function we have: 

8(kt + k 2 + k 3 + k A ) = J d 3 q 5(h + k 2 - q) S(k 3 + k A + q) , (13) 

whereas for the 5-point function we have: 

5(h + k 2 + h + h + h) = / d 3 q d 3 q' 5{h + k 2 -q) 5(q + k 3 - q) 8{q' + h + h) , (14) 



and so on, so that the iV-point 5-function is reduced to iV — 2 triangles with N — 3 
"internal momenta" (the idea is nicely illustrated in Fig. 1.) Substituting the expression 



for the iV-point 5-function into Eq. Q 1 1 p and integrating out all external momenta but 
e first (ki) and last (fcjv 

(G(x 1 )e(x 2 )...G(x N )) 



the first (ki) and last (fcjv), the result is that: 

1 



(2tt) 3JV 

x (Q(k 1 )e(q 1 -k 1 )...Q(k N )) 



<i 3 /ci <i 3 gi . . . d 3 q N - 3 d 3 k? 

ik 1 -{x 1 -x 2 ) iqi-(x 2 -x s ) JqN-r{^N-2-S N -\) „ik N -{x N ^ 1 -x N ) 



(15) 



This expressions shows explicitly that the real-space iV-point function above does not 
depend on any particular spatial point, only on the intervals between points. 






Figure 1: Diagrammatic representation of the 2, 3, 4 and 5-point correlation functions 
in Fourier space. The dashed lines represent internal momenta. 



Finally, what are the constraints imposed on the iV-point functions that come from 
isotropy alone? Clearly, no dependence on the directions defined by the points, Xi — Xj, 
can arise in the final expression for the iV-point functions in real space, so from Eq. (11) 
we see that the iV-point function in Fourier space should depend only on the moduli of 
the wavenumbers - up to some momentum- conservation 5-functions, which naturally 
carry vector degrees of freedom. 

In this review we will mostly be concerned with tests of isotropy given homogeneity 
(but not necessarily Gaussianity), so in our case we will usually assume that the iV-point 
function in Fourier space assumes the form given in Eq. (12). 
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2.3 Statistics in harmonic space 

In the previous section we characterized the statistics of our field in Fourier space, which 
in most cases is most easily related to fundamental models such as inflation. Now we 
will change to harmonic representation, because that's what is most directly related to 
the observations of the CMB, G(n), which are taken over the unit sphere S 2 . We will 
discuss mostly the two-point function here, and we defer a fuller discussion of iV-point 
functions in harmonic space to Section 3. 

From Eq. ([7]) we can start by taking the two-point function in harmonic space, and 
computing it in terms of the two-point function in Fourier space: 

{A7Tfi\-if u{kR)j e {k'R) Y im (k)Y e 1 ml (k f ) (Q(k)e*(k')) . 

(16) 

Under the hypothesis of homogeneity, this expression simplifies considerably, leading 
to: 

(ai m a* t , ml ) K = J d 3 kU e (-iY' j e (kR)j e (kR)Y em (k)Y;, ml (k) x N 2 (k) . (17) 

If, in addition to homogeneity, we also assume isotropy, then N 2 — > P(k), and the 
integration over angles factors out, leading to the orthogonality condition for spherical 
harmonics: 

d 2 kY lm {k) Yff m ,{k) = 5& Sram' , 



and as a result the covariance of the a£ m 's becomes diagonal: 

/dk 2 
— j 2 (kR) -k 3 P(k) 
k 7T 



47r 8 U > b mm i j dlogk ji(kR) A^{k) 



= Ce 5u> <5 'mm' , 

where we have defined the usual temperature power spectrum Ar(fc) = k 3 P(k)/27r 2 in 
the middle line, and the angular power spectrum Ce in the last line of Eq. (18). As 
a pedagogical note, let's recall that the power spectrum basically expresses how much 
power the two-point correlation function has per unit log k: 

(6(f)e(f )> H ,i = J dlogk ^^^- AUk) . (19) 

In an analogous manner to what was done above, we can also construct the angular 
two-point correlation function in harmonic space: 

(e(n)e(n')} = J2T,< a ^ a t'rn') Y Un)YL(n') ■ (20) 

lm I'm 1 
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The hypothesis of homogeneity by itself does not lead to significant simplifications, but 
isotropy leads to a very intuitive expression for the angular two-point function: 



(e(n)e(n')) H) i = Y<12 Ci5 M 5 rnm'Yt m {n)Yl m {h') (21) 



Im I'm! 



^C,——P l {n-n). 



Clearly, not only is this expression the analogous in S 2 of Eq. (ph, but in fact the 
Fourier power spectrum A^(k) and the angular power spectrum Cg are defined in terms 



of each other as indicated in Eq. (21): 



d = 4vr J d\ogkjl{kR) A 2 T {k) . (22) 

Now, using the facts that the spherical Bessel function of order £ peaks when its argu- 
ment is approximately given by £, and that J dlogz j](z) = l/(2£(£ + 1)), we obtain 
thatQ 

c * w itrh) A2Ak = i/R) ■ (23) 

Incidentally, from this expression it is clear why it is customary to define: 

Ct = £ ^^-C i » A 2 T (k = i/R) . 



Using Eq. (11) we can easily generalize the results of this subsection to iV-point 
functions in S 2 and in harmonic space, however, the assumption of isotropy alone does 
very little to simplify our life. The hypothesis of homogeneity, on the other hand, 
greatly simplifies the angular iV-point functions, and most of the work in statistical 
anisotropy of the CMB that goes beyond the two-point function assumes that homo- 
geneity holds. Notice that the issue of gaussianity is, as always, confined to the question 
of whether or not the two-point function holds all information about the distribution 
of the relevant variables, and is therefore completely separated from questions about 
homogeneity and/or isotropy. 



Also notice that the separable nature of the definition (20) implies here as well, like 



in Fourier space, a reciprocity relation for the correlation function: 

C(n u n 2 )=C(h 2 ,n 1 ). (24) 

This symmetry must hold regardless of underlying models, and is important in order 
to analyze the symmetries of the correlation function, as we shall see later. 



Before we move on, it is perhaps important to mention that the decomposition (20) 



is not unique. In fact, instead of the angular momenta of the parts, (£i, mi; £2, ^2), we 

1 This is one type of what has become known in the astrophysics literature as Limber's approxima- 
tions. 
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could equally well have used the basis of total angular momentum (L,M; £±,£2) and 
decomposed that expression as: 



C{n lt n 2 ) = E (M) 
where are known as the bipolar spherical harmonics, defined by |41] : 



ymhi,h 2 ) = [Y h {h 1 )®Y h {n 2 )\ 



LM 



where L and M = m\ + m 2 are the eigenvalues of the total and azimuthal angular 
momentum operators, respectively. This decomposition is completely equivalent to Eq. 



(20), and we can exchange from one decomposition to another by using the relation: 



= E K mi ^ m2 )(-i) M+ ^ fe v / 2TTT ( * ^ _ L M ) , (26) 

where the 3x2 matrices above are the well-known 3-j coefficients. At this point, it 
is only a matter of mathematical convenience whether we choose to decompose the 



correlation function as in (20) or as in (25). Although the bipolar harmonics behave 



similarly to the usual spherical harmonics in many aspects, the modulations of the 
correlation function as described in this basis have a peculiar interpretation. We will 
not go further into detail about this decomposition here, as it is discussed at length in 
another review article in this volume. 



2.4 Estimators and cosmic variance 



Returning to the covariance matrix (18), we see that, if we assume gaussianity of the 
a^ m 's, then the angular power spectrum suffices to describe statistically how much the 
temperature fluctuates in any given angular scale; all we have to do is to calculate 



the average (18). This can be a problem, though, since we have only one Universe to 



measure, and therefore only one set of aim's. In other words, the average in (18) is 
poorly determined. 

At this point, the hypothesis that our Universe is spatially homogeneous and isotropic 
at cosmological scales comes not only as simplifying assumption about the spacetime 
symmetries, but also as a remedy to this unavoidable smallness of the working cos- 
mologist's sample space. If isotropy holds, different cosmological scales are statistically 
independent, which means that we can take advantage of the ergodic hypothesis and 
trade averaging over an ensemble for averaging over space. In other words, for a given 
£ we can consider each of the 2£ + 1 real numbers in ai m as statistically independent 
Gaussian random variables, and define a statistical estimator for their variances as the 
average: 

1 - 

Ci ^ 2£+ 1 E ' a ^ m ' 2 ' ( 27 ) 
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The smaller the angular scales (£ bigger), the larger the number of independent patches 
that the CMB sky can be divided into. Therefore, in this limit we should have: 



lim Ce = Ce 

t— >oo 



On the other hand, for large angular scales (small i's), the number of independent 



patches of our Universe becomes smaller, and (27) becomes a weak estimation of the 
C(S. This means that any statistical analysis of the Universe on large scales will be 
plagued by this intrinsic cosmic sample variance. Notice that this is an unavoidable 
limit as long as we have only one observable Universe. 

Finally, it is important to keep in mind the clear distinction between the angular 



power spectrum Ce and its estimator (27). The former is a theoretical variable which 



can be calculated from first principles, as we have shown in £2.1 The latter, being a 



function of the data, is itself a random variable. In fact, if the a£ m 's are Gaussian, then 



we can rewrite expression (27) as: 

(2£+l) - 1 12 



Ce — Xe , Xe — 



Ira 



m=— t 

where Xe is a chi-square random variable with 11 + 1 degrees of freedom. According to 
the central limit theorem, when i — > oo, Xe approaches a standard normal variabld^J 
which implies that Cg will itself follow a Gaussian distribution. Its mean can be easily 



calculated using (18) and (27), and is of course given by: 

(Ce) = Ce , 

which shows that the C/s are unbiased estimators of the C/s. It is also straightforward 
to calculate its variance (valid for any £): 

((Ce - Ce)(Ce> - G>)) = 2 £ Z \ ■ 

Because this estimator does not couple different cosmological scales, it has the minimum 
cosmic variance we can expect from an estimator due to the finiteness of our sample - 
so it is optimal in that sense. Ce is therefore the best estimator we can build to measure 
the statistical properties of the multipolar coefficients ae m when both statistical isotropy 
and gaussianity hold. 

In later Sections we will explore angular or harmonic iV-point functions for which 
the assumption of isotropy does not hold. However, it is important to remember at all 
times that we have only one map, which means one set of a£ m 's. The estimator for the 
angular power spectrum, Ct, takes into account all the a^ m 's by dividing them into the 
different €s and summing over all m e (—£,£). Clearly, it will inherit a sample variance 

2 A standard normal variable is a Gaussian variable X with zero mean and unit variance. Any other 
Gaussian variable Y with mean [i and variance a can be obtained from X through Y = aX + /i. 
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for small £'s, when the a£ m 's can only be divided into a few "independent parts". As 
we try to estimate higher-order objects such as the iV-point functions, we will have 
to subdivide the a^ m 's into smaller and smaller subsamples, which are not necessarily 
independent (in the statistical sense) of each other. So, the price to pay for aiming at 
higher-order statistics is a worsening of the cosmic sample variance. 



2.5 Correlation and statistical independence 



The covariance given in Eq. (18) has two distinct, important properties. First, note 
that its diagonal entries, the Cg's, are m-independent coefficients; this is crucial for 
having statistical isotropy, as we will show latter. Second, statistical isotropy at the 
Gaussian level implies that different cosmological "scales" (understood here as meaning 
the modes with total angular momentum £ and azimuthal momentum m) should be 
statistically independent of each other - and this is represented by the Kronecker deltas 



in (18). 



In fact, statistical independence of cosmological scales is a particular property of 
Gaussian and statistically isotropic random fields, and is not guaranteed to hold when 
gaussianity is relaxed. We will see in the next Section that the rotationally invariant 
3-point correlation function (and in general any N > 2 correlation function) couples to 
the three scales involved. In particular, if it happens that the Gaussian contribution of 



the temperature field is given by (18), but at least one of its non-Gaussian moments are 
nonzero, then the fact that a particular correlation is zero, like for example (a2mi03 m2 ), 
does not imply that the scales 1 = 2 and I = 3 are (statistically) independent. This 
is just a restatement of the fact that, while statistical independence implies null cor- 
relation, the opposite is not necessarily true. This can be illustrated by the following 
example: consider a random variable a distributed as: 



P(a) 



1 a G [0, 1] 
otherwise . 



Let us now define two other variables x = cos(27ra) and y = sin(27ra). From these 
definitions, it follows that x and y are statistically dependent variables, since knowledge 
of the mean/variance of x automatically gives the mean/variance of y. However, these 
variables are clearly uncorrelated: 



1 f 27r 

(xy) = — / cos r] sin rjdr] = . 
2tt J 



Although correlations are among cosmologist's most popular tools when analyzing CMB 
properties, statistical independence may turn out to be an important property as well, 
specially at large angular scales, where cosmic variance is more of a critical issue. 
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3 Beyond the standard statistical model 



Until now we have been analyzing the properties of Gaussian and statistically isotropic 
random temperature fluctuations. This gives us a fairly good statistical description 
of the Universe in its linear regime, as confirmed by the astonishing success of the 
ACDM model. This picture is incomplete though, and we have good reasons to search 
for deviations of either gaussianity and/or statistical isotropy. For example, the ob- 
served clustering of matter in galactic environments certainly goes beyond the linear 
regime where the central limit theorem can be applied, therefore leading to large de- 
viations of gaussianity in the matter power spectrum statistics. Besides, deviations of 
the cosmological principle may leave an imprint in the statistical moments of cosmo- 
logical observables, which can be tested by searching for spatial inhomogeneities jl2] or 
directionalities [43j. 

But how do we plan to go beyond the standard model, given that there is only one 
Gaussian and statistically isotropic description of the Universe, but infinite possibili- 
ties otherwise? This is in fact an ambitious endeavor, which may strongly depend on 
observational and theoretical hints on the type of signatures we are looking for. In the 
absence of extra input, it is important to classify these signatures in a general scheme, 
differentiating those which are non-Gaussian from those which are anisotropic. Further- 
more, given that the signatures of non-gaussianity may in principle be quite different 
from that of statistical anisotropy, such a classification is crucial for data analysis, which 
requires sophisticated tools capable of separating these two issues^} 

We therefore start §3.1 by analyzing deviations of gaussianity when statistical isotropy 



holds. In {3.2 we keep the hypothesis of gaussianity and analyze the consequences of 



breaking statistical rotational invariance. 



3.1 Non- Gaussian and SI models 

3.1.1 Rotational invariance of iV-point correlation functions. 

We turn know to the question of non-Gaussian but statistically isotropic probabilities 
distributions. We will keep working with the iV-point correlation function defined in 
harmonic space, 

(a£ imi a£ 2m2 . . . a£ NmN ) (28) 

since knowledge of these functions enables one to fully reconstruct the CMB temper- 
ature probability distribution. Specifically, we would like to know the form of any 
iV-point correlation function which is invariant under arbitrary 3-dimensional spatial 
rotations. When rotated to a new (primed) coordinate system, the iV-point correlation 

3 Although gaussianity and homogeneity/isotropy are mathematically distinct properties, it is pos- 
sible for a Gaussian but inhomogeneous/anisotropic model to look like an isotropic and homogeneous 
non-gaussian model. See for example |44j . 
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function transforms as: 



(a^mi^mi • • • ae N m> N ) - ^2(at imi ae 2m2 . . . Vwl^mi^m, ■ ■ ■ D m' N m N > ( 29 ) 

allm 

where the D* ,(a, /3, j)'s are the coefficients of the Wigner rotation-matrix, which 

1 i 

depend on the three Euler-angles a, (3 and 7 characterizing the rotation. Notice that 
in this notation the primed (rotated) system is indicated by the primed m's. For the 
2-point correlation function, we have already seen that the well-known expression 

does the job: 

(at^at^) = C tl ( E(- 1 ) mi ^' imi ^- mi ) <V 2 

V mi / 
= (— l^CfrSi^Sm^ - m > 2 . 

Note the importance of the angular spectrum, Cg, being a m-independent function. 

What about the 3-point function? In this case, the invariant combination is found 
to be: 

\0-£ 1 m 1 a £ 2 m 2 a £31713 ) = B 

which can be verified by straightforward calculations. Again, the non-trivial physi- 
cal content of this statistical moment is contained in an arbitrary but otherwise m- 
independent function: the bispectrum B^ 2 ^ 3 [45, 46, 47J. As we anticipated in Section 
2, rotational invariance of the 3-point correlation is not enough to guarantee statistical 
independence of the three cosmological scales involved in the bispectrum, although in 
principle a particular model could be formulated to ensure that oc Si 1 e 2 Se 2 £ 3 , at 

least for some subset of a general geometric configuration of the 3-point correlation 
function. 

These general properties hold for all the iV-point correlation function. For the 4- 
point correlation function, for example, Hu [48] have found the following rotationally 
invariant combination 

(a, imi ...a, 4m4 ) = gg|g(L)(-l) M ^ 1 i ^ - M )(t 3 ™ 4 m ) 

where the Q^^(L) function is known as the trispectrum, and L is an internal angular 
momentum needed to ensure parity invariance. In a likewise manner, it can be verified 
that the following expression 

(a iimi . . . a 4m6 ) = Yl P Ws( L ' L ')( _1 ) M+M ( t 2 —M ) 

LM L'M> V ' 

/ £ 3 U L> \f 4 L V \ 
\ m 3 m 4 —M' J \m 5 M M' ) 
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*1 ^2 ^3 

rrii m 2 m 3 



gives the rotationally invariant quadrispectrum P^ 2 £r (L, L') 






Figure 2: Diagrammatic representation of the 2, 3, 4 and 5-point correlation functions 
in harmonic space. Here I actually represents the pair (£,m). 



The examples above should be enough to show how the general structure of these func- 
tions emerges under SI: apart from a m-independent function, every pair of momenta 
£i in these functions are connected by a triangle, which in turn connects itself to other 
triangles through internal momenta when more than 3 scales are present. In Fig. [2] we 
show some diagrams representing the functions above. 

Although we have always shown iV-point functions which are rotationally invariant, 
the procedure used for obtaining them was rather intuitive, and therefore does not offer 
a recipe for constructing general invariant correlation functions. Furthermore, it does 
not guarantee that this procedure can be extended for arbitrary iV's. Here we will 
present a recipe for doing that, which also guarantees the uniqueness of the solution. 

The general recipe for obtaining the rotationally invariant iV-point function is as 



follows: from the expression (29) above, we start by contracting every pairs of Wigner 



functions, where by "contracting" we mean using the identity 



D 1 , (tu)D 12 , (u) 



E 



mi rrin 



L,M,M' 

x(2L + l)(-r 



L 

-M 



mi 



r-2 
m 2 



L 

-M' 



M+M' n L 



and where u = {a,/3,7} is a shortcut notation for the three Euler angles. Once this 
contraction is done there will remain [iV/2] D-functions, which can again be contracted 
in pairs. This procedure should be repeated until there is only one Wigner function 
left, in which case we will have an expression of the following form: 



l\m\ 



...at 



) 



all m' 



(a £im j . . . a iNm ' N ) x ^geometrical factors x D 



L 

MM' 



Now, we see that the only way for this combination to be rotationally invariant is 
when the remaining D MM , function above does not depend on u, i.e., D MM ,(u>) = 
5lq5mo^m'o- Once this identity is applied to the geometrical factors, we are done, and 
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the remaining terms inside the primed m-summation will give the rotationally invariant 
(iV-l)-spectrum. 

As an illustration of this algorithm, let us construct the rotationally invariant spec- 
trum and bispectrum. For the 2-point function there is only one contraction to be done, 
and after we simplify the last Wigner function we arrive at 



(ah rn i 

(If 



2«1-2 / 



E 



(Kim; I 2 ) 

2£ 1 + 1 



•l) m2 ^ife^mi,-m 2 j 



where, of course: 



C, 



2£ + 



^tE<i 



is the well-known definition of the temperature angular spectrum. For the 3-point 
function there are two contractions, and the simplification of the last Wigner function 
gives 



(at! mi 



3TO3/ 



E 



rn , ,rn:, . rn .. 



m\ m 2 m. 



*1 *2 *3 

mi m 2 m 3 



From this expression and the ortoghonality of the 3-j's symbols (see the Appendix), we 
can immediately identify the definition of the bispectrum: 



B 



mi,m2,m 3 



mi Oj £ 2 m2 %iri3 / 



*1 

mi 



*2 

m 2 



^3 

m 3 



It should be mentioned that this recipe not only enables us to establish the rotational 
invariance of any iV-point correlation function, but it also furnishes a straightforward 
definition of unbiased estimators for the iV-point functions. All we have to do is to 
drop the ensemble average of the primed a£ m 's. So, for example, for the 2- and 3-point 
functions above, the unbiased estimators are given respectively by: 



Co 



2£ + 



-E 



a ern a £in 



B 



E 



mi,m 2 ,m3 



^1 

m x 



^2 

m 2 



^3 

m 3 



Notice that isotropy plays the same role, in S 2 , that homogeneity plays in IR 3 . What 
enforces homogeneity in IR 3 is the Fourier-space 5-functions, as in the discussion around 
Fig. 1. However, in S 2 the equivalent of the Fourier modes are the harmonic modes, 
for which there is only a discrete notion of orthogonality - and no Dirac 5-function. 
What we found above is that the Wigner 3-j symbols play the same role as the Fourier 
space (^-functions: they are the enforcers of isotropy (rotational invariance) for the 
TV-point angular correlation function. Hence, the diagrammatic representations of the 
constituents of the iV-point functions in Fourier (Fig. 1) and in harmonic space (Fig. 
2) really do convey the same physical idea - one in IR 3 , the other in S 2 . 
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3.2 Gaussian and Statistically Anisotropic models 

In the last section, we have developed an algorithm which enables one to establish the 
rotational invariance of any iV-point correlation function. As we have shown, this is 
also an algorithm for building unbiased estimators of non-Gaussian correlations. In this 
section we will change the perspective and analyze the case of Gaussian but statistically 
anisotropic models of the Universe. 

There are many ways in which statistical anisotropy may be manifested in CMB. 
From a fundamental perspective, a short phase of inflation which produces just enough 
e-folds to solve the standard Big Bang problems may leave imprints on the largest 
scales of the Universe, provided that the spacetime is sufficiently anisotropic at the 
onset of inflation [39J. Another source of anisotropy may result from our inability 
to efficiently clean foreground contaminations from temperature maps. Usually, the 
cleaning procedure involves the application of a mask function in order to eliminate 
contaminations of the galactic plane from raw data. As a consequence, this procedure 
may either induce, as well as hide some anomalies in CMB maps |28j . 

It is important to mention that these two examples can be perfectly treated as 
Gaussian: in the first case, the anisotropy of the spacetime can be established in the 
linear regime of perturbation theory, and therefore will not destroy gaussianity of the 
quantum modes, provided that they are initially Gaussian. In the second case, the mask 
acts linearly over the temperature maps, therefore preserving its probability distribution 



3.2.1 Primordial anisotropy 

Recently, there have been many attempts to test the isotropy of the primordial Universe 
through the signatures of an anisotropic inflationary phase |38 | l39l H0~t |50~1 I5T] . A generic 
prediction of such models is the linear coupling of the scalar, vector and tensor modes 
through the spatial shear, which is in turn induced by anisotropy of the spacetime [38J. 
Whenever that happens, the matter power spectrum, defined in a similar way as in Eq. 



(12), will acquire a directionality dependence due to this type of see-saw mechanism. 



This dependence can be accommodated in a harmonic expansion of the form: 

P{k) = Y,n m {k)Y tm {k) , (30) 

— * 

where the reality of P(k) requires that r£ m (k) = (— l) m rj _ m (k). Given that temperature 
perturbations Q(x) are real, their Fourier components must satisfy the relation Q(k) = 



Q*(—k). This property taken together with the definition (12) implies that: 



P(k) = P(-k) , (31) 



which in turn restricts the £'s in (30) to even values. Also, note that by relaxing the 



assumption of spatial isotropy, we are only breaking a continuous spacetime symmetry, 
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but discrete symmetries such as parity should still be present in this class of models. 
Indeed, by imposing invariance of the spectrum under the transformation z — > —z, 
we find that (— l)^~ m = 1. Similarly, invariance under the transformations x — > —x 
and y — > —y imply the conditions r^ m = (— l) m r^_ m and r^ m = r^_ m , respectively. 
Gathering all these constraints with the parity of £, we conclude that 

r im 6 1, £, m e 2N . (32) 

That is, from the initial 2^ + 1 degrees of freedom, only 1/2 + 1 of them contribute to 
the anisotropic spectrum [39J. 

3.2.2 Signatures of statistical anisotropies 



The selection rules (32) are the most generic predictions we can expect from models 
with global break of anisotropy. We will now work out the consequences of these rules 
to the temperature power spectrum, and check whether they can say something about 
the CMB large scale anomalies. 



From the expressions (16) and (30) we can immediately calculate the most general 



anisotropic covariance matrix [37J: 



where: 



\ _ \ p.ll&h Tjhm,- A /oq\ 
/ — /—I y m 1 m 2 m 3 - n e 1 e 2 ' V°°/ 



,Wa _/ 1 vn l ./(2^i + l)(24 + l)(2f 3 + l) ( l x £ 2 £s\f £1 £2 £3 



Ql\t2tZ _ \rai 

^m im2 m 3 47T V 000 7V _m l m 2 m 3 

(34) 

are the Gaunt coefficients resulting from the integral of three spherical harmonics (see 
the Appendix). These coefficients are zero unless the following conditions are met: 

h + £2 + £3 e 2N 

nil + rri2 + = (35) 

\£i -£ 1 \<£ k <£i + tj Vi, j, k g {l, 2, 3} . 



The remaining coefficients in (33) are given by: 



POO 

Hi™ = Am£l ' i2 / d lo § k r ^m 3 (k) Jh (kR)j £2 (kR) 
Jo 



and correspond to the anisotropic generalization of temperature power spectrum (18) 



The selection rules (35), taken together with (32), lead to important signatures in 
the CMB. In particular, since £3 is even, the quantity £\ ± £2 must also be even, i.e., 
multipoles with different parity do not couple to each other in this class of models: 

(a iimi a* l2rn2 ) = , £1 + £ 2 = odd . (36) 
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This result is in fact expected on theoretical grounds, because by breaking a continuous 
symmetry (isotropy) we cannot expect to generate a discrete signature (parity) in CMB. 
However, notice that the absence of correlations between, say, the quadrupole and the 
octupole, does not imply that there will be no alignment between them. One example 
of this would be a covariance matrix of the form C^ m 8^8 mm i. If the C^q happen to be 
zero, for example, then all multipoles will present a preferred direction (in this case, 
the z-axis.) 



3.2.3 Isotropic signature of statistical anisotropy 

We have just shown that a generic consequence of an early anisotropic phase of the 
Universe is the generation of even-parity signatures in CMB maps. Interestingly, these 
signatures may be present even in the isotropic angular spectrum, since the C/s acquire 
some additional modulations in the presence of statistical anisotropics. In principle, 
these modulations could be constrained by measuring an effective angular spectrum of 
the form 

(a im a* £m ) = + £mf-m,o-^«° > (37) 

where we have introduced a small e parameter to quantify the amount of primordial 
anisotropy. 

In order to constrain these modulations we have to build a statistical estimator for 
(a£ m a} m ). Since we are looking for the diagonal entries of the matrix (33), a first guess 
would be 



r* eS — ! 

" 2£ + l 2s 



(38) 



m=— I 



To check whether this is an unbiased estimator of the effective angular spectrum, we 
apply it to (33) and take its average: 

e'>o,m 



Using the definition (34) of the Gaunt coefficients, the m-summation in the expression 
above becomes 



\£— m 



ni 



-m 



VWTl5 e/ o = 



where the last equality follows because £' > 0. Consequently we conclude that 

(Cf) = C e ^ (a em ag m ) . 

At first sight, this result may seem innocuous, showing only that this is not an appro- 
priate estimator for (37). Note however that (38) is in fact the estimator of the angular 
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spectrum usually applied to CMB data under the assumption of statistical isotropy. 
In other words, by means of the usual procedure we may be neglecting important in- 
formation about statistical anisotropy. Moreover, the cosmic variance induced by the 
application of this estimator on anisotropic CMB maps is small, because, as it can easily 
be checked: 

((Of - C t )(Cf - C e )) = ^ C\ 6 U , + 0(e 2 ) . 

This result shows that the construction of statistical estimators strongly depends on 
our prejudices about what non-gaussianity and statistical anisotropy should look like. 
Consequently, an estimator built to measure one particular property of the CMB may 
equally well hide other important signatures. One possible solution to this problem is 
to let the construction of our estimators be based on what the observations seem to tell 
us, as we will do in the next Section. 
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Part II 

Cosmological Tests 

So much for mathematical formalism. We will now turn to the question of how the 
hypotheses of gaussianity and statistical isotropy of the Universe can be tested. Though 
we are primarily interested in applying these tests to CMB temperature maps, most 
of the tools we shall be dealing with can be applied to polarization {E- and i?-mode 
maps) and to catalogs of large-scale structures as well. 

Testing the gaussianity and SI of our Universe is a difficult task. Specially because, 
as we have seen, there is only one Gaussian and SI Universe, but infinitely many uni- 
verses which are neither Gaussian nor isotropic. So what type of non-gaussianity and 
statistical anisotropy should we test for? In order to attack this problem we can follow 
two different routes. In the bottom-up approach, models for the cosmic evolution are 
formulated in such a way as to account for some specific deviations from gaussianity and 
SI. These physical principles range from non-trivial cosmic topologies (52J [53j El], pri- 
mordial magnetic fields [S5j M, E7J EH EH] , local |60j US EH [62] and global H2 M, M, EQ] 
manifestation of anisotropy, to non-minimal inflationary models [5T| [63| Ell E5j 166], |6"T] . 
The main advantage of the bottom-up approach is that we know exactly what feature 
of the CMB is being measured. One of its drawbacks is the plethora of different models 
and mechanisms that can be tested. 

The second possibility is the top-down, or model-independent approach. Here, we 
are not concerned with the mechanisms responsible for deviations of gaussianity or SI, 
but rather with the qualitative features of any such deviation. Once these features are 
understood, we can use them as a guide for model building. Examples here include 
constructs of a posteriori statistics [29| [351 136] and functional modifications of the two- 
point correlation function [281 E3 EEl EH EQl [H]. 

In the next section we will explore two different model-independent tests: one based 
on functional modifications of the two-point correlation function, and another one based 
on the so-called Maxwell's multipole vectors. 

4 Multipole vectors 

Multipole vectors were first introduced in cosmology by Copi et al. [23] as a new 
mathematical representation for the primordial temperature field, where each of its 
multipoles I are represented by I unit real vectors. Later it was realized that this idea 
is in fact much older [72], being proposed originally by J. C. Maxwell in his Treatise on 
Electricity and Magnetism. 

The power of this approach is that the multipole vectors can be entirely calculated 
in terms of a temperature map, without any reference to external reference frames. 
This make them ideal tools to test the morphology of CMB maps, like the quadrupole- 
octupole alignment. 
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The purpose of the following presentation is only comprehensiveness. A mathemat- 
ically rigorous introduction to the subject can be found in references |73| I72| 174"] . as 
well as other review articles in this review volume. 



4.1 Maxwell's representation of harmonic functions 

We start our presentation of the multipole vectors by recalling some terminology. A 
harmonic function in three dimensions is any (twice differentiable) function h that 
satisfies Laplace's equation: 

V 2 h = , (39) 

where V 2 is the Laplace operator. In spherical coordinates, the formal solution to 
Laplace's equation which is regular at the origin (r = 0) is: 

oo i 

h = ^2h e (r,9,v) , h e = ^ a em r e Y em (9,p) ■ (40) 

1=0 m=-l 

The functions r Yi m are known as the solid spherical harmonics [H]. Since they agree 
with the usual spherical harmonics on the unit sphere, it is sometimes stated in the liter- 
ature that the latter form a set of harmonic functions. This is an abuse of nomenclature 
though, and the reader should be careful. 

Given the scalar nature of Laplace's operator, it is possible to find solutions to 



Eq.(39) in terms of Cartesian coordinates. Such solutions can be constructed by com- 
bining homogeneous polynomial^] of order £: 

oo e 
h = ^2h t {x,y,z) , h l = ^2\ abc x a y b z c , {a + b + c = i) . (41) 

i=0 abc 

In three dimensions, the most general homogeneous polynomial of order £ contains 
(£ + 2)\/(2\£\) independent coefficients. However, since each polynomial must indepen- 



dently satisfy Eq. (39), precisely £\/(2\(£ — 2)\) of these coefficients will depend on each 
other. This constraint leaves us with {£ + 1){£ + 2)/2 — £{£ — l)/2 = 2£ + 1 indepen- 
dent degrees of freedom in each multipole i - which is, of course, the same number of 



independent degrees of freedom appearing in Eq. (40). 

Based on this analysis, Maxwell introduced his own representation of harmonic 
functions. He noticed that by successively applying directional derivatives of the form 
v ■ V = V{? over the monopole potential 1/r, where r = a/x 2 + y 2 + z 2 and v is a unit 



vector, he could construct solutions of the form (41). That is: 

1 



f e (x, y, z) — X e Vfy . . . V^V^ 

r 



(42) 

r=l 



4 A homogeneous polynomial is a sum of monomials, all of the same order. 
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where Xi are real constants. We are now going to show that this construction does indeed 
lead to solutions of the form (41). First, note that there is a pattern which emerges 
from successive application of directional derivatives over the monopole function: 



—V\ ■ r 

3(^2 • r)(vi ■ r) - r 2 (i/i ■ y 2 ) ^ 

and so on. By induction, one can show that the general expression will be given by: 

, (-1)^(21-1)!! nti ^+r 2 Q,_ 2 

h = ^e+i ' t 44 J 

where Qe-2 is a homogeneous polynomial of order £—2 which only involves combinations 
of the vectors V; L and r. 

The numerator of the function fi given by (44) is clearly a homogeneous polyno- 
mial of order I (as one can easily check for some Ps and also prove by mathematical 
induction.) A not so obvious result is that this polynomial is also harmonic. To prove 
that, let us define: 

i 

g e = (-1) £ (2£ - 1)!! ■ r + r 2 Q e _ 2 , (45) 

i=l 

and consider the application of the operator V 2 over the combination r a g£. For the 
^-component we get: 

d 2 x {r a g f ) = r a d 2 x g e + 2ar a - 2 xd x g e + [ar a ~ 2 + a(a - 2)x 2 r a - 4 ]g e . 

Repeating this process for the y- and z-components and then adding the results, we 
find: 

V 2 (r» = r a V 2 g e + 2ar a - 2 (xd x g e + yd y g e + zd z ge) + a(a + l)r a - 2 g e 
= r a V 2 g t + a{a + 2Z+l){r a - 2 g t ) 1 

where in the last step we have used Euler's theorem on homogeneous functions, i.e., 
f ■ Vge = £ge. If we now choose a = —(2i +1), we find immediately that 

V 2 ( 9i \ ^ 2 9e 



r 2l+l J r 2l+l ' 



By construction, the left-hand side of the above expression is equal to V 2 fi. But 
according to the definition (42) this quantity is also zero, since Laplace's operator 
commutes with directional derivatives and V 2 (l/r) =0 for r > 0. Therefore, V 2 ge = 
and g£ is harmonic, which completes our proof. 
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In conclusion, Maxwell's construction of harmonic functions, Eq. (42), is completely 
equivalent to the standard representation in terms of spherical harmonics. More im- 
portantly, this gives a one-to-one relationship between temperature maps (given by the 
G^ m 's) and I unit vectors Vi. This means that the multipole vectors can be directly 
calculated from a CMB map, without any reference to external reference frames or 
additional geometrical constructs. The reader interested in algorithms to construct the 
vectors from CMB maps may check references [23| 172] , as well as the other articles in 
this volume that review this approach. 



4.2 Multipole vectors statistics 

It should be clear from the discussion above that the multipole vectors give an intuitive 
way to discover, interpret and visualize phase correlations between different multipoles 
in the CMB maps. But they also can reveal intra-multipole features, such as planarity 
(when a given multipole presents a preferred plane.) Some of the most conspicuous 
hints of statistical anisotropy in the CMB, like the quadrupole-octupole alignment, 
have indeed been first found with less mathematically elegant methods [22, 27J, but are 
best described in terms of the multipole vector formalism [23, EU [26j [72]. However, 
this case should also sound an alarm, because some feature of a (presumably) random 
realization was found, then further scrutinized with a certain test which was, to some 
extent (intentionally or not) tailored to single out that very feature. 

The pitfalls of aprioristic approaches are sometimes unavoidable, and all we can do 
is take a second look at our sample with a more generic set of tools, to try to assess how 
significant our result really is in the context of a larger set of statistical tests. Multipole 
vectors are, in fact, ideally suited to this, since they can be found for all multipoles, 
and it is relatively easy to construct scalar combinations of these vectors with the usual 
methods of linear algebra. Also convenient is the fact that simulating these vectors 
from maps, or even directly, is also relatively easy, so the standard model of a Gaussian 
random field can be easily translated into the pdf 's for the tests constructed with the 
multipole vectors. An important drawback of the multipole vectors is that, because 
they are computed in terms of equations which are non-linear in the temperature fields, 
the distribution functions of statistical tests involving these vectors are highly non- 
Gaussian [75]. This means that these statistics usually have to be estimated by means of 
simulations (usually assuming that the underlying temperature field is itself Gaussian.) 
Another delicate issue is how stable the multipole vectors are to instrumental noise 
and sky cut - and here, again, we must rely on numerical simulations to compare the 
observations with theoretical models. 

In order to see how one should go about constructing a general set of tests, it 
should be noted first that the multipole vectors define directions, but they have no 
sense (they are "headless vectors".) This follows from the fact that the sign of the 
constants are degenerate with the sign of the vectors v^ p (p = 1, . . . , £ .) Hence, the 
first requirement is that our tests should be independent of this sign ambiguity. Notice 
that this ambiguity extends to ancillary constructs such as the normal vectors, defined 
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as the vector product: 

wippi = n, p <g> V£, p > . (46) 

Mathematically, the sign ambiguity implies that the multipole vectors belong to the 
quotient space S 2 /Z2 (also known as MP 2 ), while the normal vectors belong to IR 3 /Z,2. 

In principle, any positive-definite scalar constructed through the multipole or the 
normal vectors is "fair game", but there are some guidelines, e.g., one should avoid 
double-counting the same degrees of freedom. Below we review some of these tests, 
based on the work of Ref. |35j . 



4.2.1 The R statistic 

The first example of a test involving the multipole vectors would be a scalar product, 
such as v ■ v '. The most natural test would involve asking whether the £ multipole 
vectors of the given multipole £ are especially aligned or not. This means computing: 

2 - 

Ru = 1) l^'f ' > ( 47 ) 
p,p'>p 

where the normalization was introduced to make < Ru < 1. 

This idea could be generalized to test alignments between multipole vectors at dif- 
ferent multipoles: 

Ree = l^'P ' • ( 48 ) 

p,p' 

In fact, the quadrupole-octupole alignment can already be seen with this simple test: 
for essentially all CMB maps the significance of the alignment as measured by -R23 is of 
the order of 90-95% C.L. 



4.2.2 The S statistic 

The second most natural test does not involve directly the multipole vectors themselves, 
but the normal vectors that can be produced by taking the vector product between the 
multipole vectors. So, we take: 

wi tPP > = V£ >p <8> ve, P ' ■ (49) 

Notice that the number of normal vectors for a given multipole I is I — £(£ — l)/2 - 
so the number of normal vectors grows rapidly for larger multipoles, making it harder 
to use and meaning that the same degrees of freedom may be overcounted, at least for 
£ > 3. 

Again, the best strategy is simplicity: we can ask whether the normal vectors are 
aligned, within and between multipoles. This means computing: 

2 ' 

Su = 77] _ 7x 2^ K,P • &tj/\ , (so) 
p,p'>p 
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where the normalization was introduced again to make < Su < 1- 

And yet again this idea is easily generalized to test alignments between normal 
vectors at different multipoles: 

Sw = \ w£, P ■ wi', P >\ ■ (51) 

p,p' 

With this test, the quadrupole-octupole alignment is much more significant: for es- 
sentially all CMB maps the test S23 deviates from the expected range with 98% C.L. 

4.2.3 Other tests with multipole vectors 

One can go on and expand the types of tests using both multipole and normal vectors. 
One idea would be, e.g., to disregard the moduli of the normal vectors: 

p,p' 

This test is therefore insensitive to the relative angle between the multipole vectors 
that produce any given normal vectors. This idea is similar to the planar modulations 
that will be discussed in the next Section. With this test, the quadrupole-octupole 
alignment is significant to about 95-98% C.L. This test can also be generalized to a 
self-alignment test [i = £'), with just an adjustment to the normalization. 

Another possibility would be to measure the alignment between multipole vectors 
and normal vectors: ^ 

B tt> = jp 5^ ■ tivj I • ( 53 ) 
p,p' 

Of course, this test cannot be easily generalized to a self- alignment. With the B 2 3 test, 
the quadrupole-octupole alignment is significant to about 95-98% C.L. 

We could go on here, but it should be clear that all the information (the 11 + 1 real 
degrees of freedom for each multipole £) has already been exhausted in the tests above. 

5 Temperature correlation function 

Despite their strong cosmological appeal, the multipole vectors have some limitations. 
Not only their directions in the CMB sky are sometimes difficult to interpret physi- 
cally [26J, they also have the additional drawback of mixing, in a non-trivial manner, 
information on both gaussianity and SI of the map being analyzed [26, 75J. 

Another way of quantifying deviations in the standard statistical framework of cos- 
mology is through functional modifications of the two-point correlation function [37, 68j. 
Although this approach does not offer an optimal separation between gaussianity and 
SI (which is, by the way, an open problem in this field), working with the two-point 
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correlation function makes it easier to test Gaussian models of statistical anisotropy 

pHEE]. 



The most general 2-point correlation function (2pcf ) of two independent unit vectors 
is a function C of the form 

C : S 2 x S 2 ^R. 



We have seen in £2.3 that, if we choose spherical coordinates (9i,(pi) to describe each 
vector hi, the function above can be decomposed either in terms of two spherical har- 
monics Yi- m . or in terms of the bipolar spherical harmonic yf^f 2 - In any case, the 2pcf 



will have the following functional dependence 



c = c(e 1 ,< Pl ,e 2 ,( P2 ) 



(54) 



This function is absolutely general. If the Universe has any cosmological deviation 
of isotropy, whatever it is, it can be described by the function above (see also Fig. (pi) 




Figure 3: Geometrical representation of the 2pcf in terms of two unit vectors. 



Unfortunately, this function will be of limited theoretical interest unless we have 
some hints on how to select its relevant degrees of freedom. This difficulty is in fact 
a general characteristic of model-independent tools, which at some stages forces us to 
rely on our theoretical prejudices about the statistical nature of the Universe in order 
to construct estimators of non-gaussianity and/or statistical anisotropy. Nonetheless, 



it is still possible to construct statistical estimators of anisotropy based on Eq.(54) 



For example, Hajian and Souradeep [68J have constructed an unbiased estimator Ki for 
this function based solely on the requirement that this estimator should be rotationally 
invariant. Although it is true that any statistically significant Kg > will point towards 
anisotropy, it is not clear what type of anisotropy is being detected by this estimator. 



We can still use Eq. (54) to search for deviations of SI if we restrict its domain to 



a smaller and non-trivial sub-domain. For example, we can take the vectors hi and h 2 
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to be the same, and expand a function of the form 



C = C{6 llVl ). (55) 

which is equivalent to C : S 2 — > R. This form of the 2pcf makes it ideal for searching 
for power multipole moments in CMB, once a suitable estimator is defined [37]. Un- 
fortunately, when we take hi = h% we are in fact considering a one-point correlation 
function, which by construction does not allows us to measure correlations between 
different points in the sky. 



It seems in principle that the functions (55) and (54) are the only possibilities besides 



the isotropic 2pcf Eq.(22). If not, what other combinations of the vectors hi and h 2 



can we consider? As a matter of fact, these two vectors are geometrical quantities 
intuitively bound to our notion of two-point correlation functions on the sphere. From 
this perspective they are not fundamental quantities. In fact, we can equally well 
represent the 2pcf by a disc living inside the unit sphere, as shown in Fig. 4 




Figure 4: Geometrical representation of the 2pcf in terms of a unit disc (or plane). 



In this representation, 6 is the angle between the vectors hi and 77-2, as usual. The 
normal to the plane, h, is represented by two spherical angles (0, $). Finally, there is an 
overall orientation (p of the disc around its unit vector which completes the four degrees 



of freedom contained in Eq.(54). We have found therefore another valid geometrical 



representation of the most general 2pcf: 



c = c{e,$,o,<p) 



(56) 



The main advantage of the above representation when compared to (54) is its 
straightforward geometrical interpretation. First, note that the angular separation 
9 of the isotropic 2pcf is trivially included in this definition, and do not need to be 
obtained as a consequence of rotational invariance (see the discussion in S 3.1.1 ) Sec- 



ond, by characterizing the correlation function in terms of the geometrical components 
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of a disc, we know exactly what are the degrees of freedom involved. This makes it 
easier to construct estimators of statistical anisotropy, alleviating the drawbacks of 
model-independent approaches mentioned above. 



5.1 Anisotropy through planarity 



An immediate application of the representation (56) is its use in the search for planar 
deviations of isotropy in CMB [7TJ [28]. Planar modulations of astrophysical origin 
may play an important role to the CMB morphology. One example is the role played 
by the galactic and ecliptic plane in the quadrupole-octupole/north-south anomalies 
|26| . Also, it is well known that our galactic plane is sensible source of foreground 
contamination in the construction of cleaned CMB maps. These hints indicate that 
CMB modulations induced by the disc in Fig. 4 is not only a mathematical possibility, 
but perhaps also a symmetry of cosmological relevance. 

Since we are primarily interested in measuring planar modulations of CMB, but 
including the usual angular modulation as the isotropic limit of the 2pcf, we can consider 



only the azimuthal average of Eq. (56): 



1 



2tt 



C^— / C(e,$,9,<p)dip. (57) 







The resulting function can be easily expand in terms of simple special functions as 

c(e, $, e) = E ^Jici m P/(cos 0)y Im (e, $) , i e 2N (58) 

l l,m ^ 

where the restriction on the /-mode results from the symmetry h\ h 2 [76]. The 
multipolar coefficients C l e m correspond to a generalization of the usual angular power 
spectrum C/s. In fact, they can be seen as the coefficients of a spherical harmonic 
decomposition of the function Ce(n), provided that this function suffers modulations as 
we sweep planes on the sphere. 

5.1.1 Angular-planar power spectrum 

Since we are restricting our analysis to the Gaussian framework, the set of coefficients 
C l £ m is all we need to characterize the two-point correlation function. However, the 
final product of CMB observations are temperature maps, and not correlation maps. 
What we need then is an algebraic relation between the multipolar coefficients C l ™ and 
the temperature coefficients a^ m defined in Eq.fl3J). At first sight, this relation could 



be obtained by equating expression (58) to its standard definition in Eq. (20), and 
then using the orthogonality of the special functions to isolate the C l e m, s in terms of the 
corn's. But for that to work we need the relation between the set of angles (0, $, 9) 
and (pi, 9 2 , tp?) which, depending on the reference frame we choose, is extremely 
complicated. Fortunately, all we need is the relation: 

hi - n 2 = cos 9 = cos 9 1 cos 9 2 + sin 9\ sin 9 2 cos(</?x — ip 2 ) , (59) 
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together with a suitable choice of our coordinate system. For example, we can use the 
invariance of the scalar product hi ■ h 2 and choose our coordinate system such that the 
disc of Fig. 4 lie in the xy plane. With this choice we will have: 



(9,$) = (0,0) 



cos 8 = cos(<£>i — (f2) 



and the integration over 9 becomes simple. Once this is done, we make a passive 
rotation of the coordinate system and then we integrate over the remaining angles 
and $, which will then be given precisely by the Euler angles used in the rotation. The 
details are rather technical and can be found in the Appendix. The final expression is: 



mnlm 



V21 + T 



£i,mi £2,m.2 



1112 



a £21712) 
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mi 



t2 

m 2 



T l,£ 



(60) 



where 



-^2 = 5Z( -1 ) m '^ ?im '^ 2m ( 



I li 
m 



*2 

—m 



d(— cos 6) Pg(cos 6)e 



vrnS 



(61) 



and where the A^ m 's form a set of coefficients resulting from the 9 integration, which 
are zero unless 1^ + m = even (see the Appendix for more details). 



Expression (60) is what we were looking for. With this relation, the angular-planar 
power spectrum Cg" 1 can be calculated from first principles for any model predicting a 



specific covariance matrix. Moreover, since the angular-planar function (58) is, after all 



a correlation function, it should be possible to relate the angular-planar power spectrum 
C 1 / 11 to the bipolar power spectrum A.f^f of Hajian and Souradeep. In fact, by inverting 



expression (26) and plugging the result in (60), we find a linear relation between these 



two set of coefficients: 



c 



Irn 



lm 1 



{21 + I) III 



i.t 



Here, the set of geometrical coefficients 1^ plays a similar role to the 3-j symbols 
in expression (26). Note also the angular-planar power spectrum has only three free 



indices, while the bipolar power spectrum has four. This is a consequence of the az- 



imuthal average we took in (57), which further constrains the degrees of freedom of the 
correlation function. 



5.1.2 Statistical estimators and x 2 analysis 



We have shown that the angular-planar power spectrum (60) is given in terms of an 
ensemble average of temperature maps. Evidently, we cannot calculate it directly from 
data, for we have only one CMB map (the one taken from our own Universe.) The best 
we can do is to estimate the statistical properties of (60), like its mean and variance, 
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and see whether these quantities agree, in the statistical sense, with what we would 
expect to obtain from a particular model of the Universe. 

The reader should note that this procedure is not new - its limitation is due to the 
same cosmic variance which lead us to construct an estimator for the angular power 



spectrum Cg (see the discussion of 62.4). For the same reason, we will need to construct 



an estimator for the angular-planar power spectrum. An obvious choice is: 

C e lm ee 2VTV/27TT £ £ a eimi a hm2 ( ^ * ^ ) , (62) 

il,mi £2, ni2 

for in this case we have an unbiased estimator, 

lf> lm\ nlm 

1 — > 

regardless of the underlying model. 

If we now have a model predicting the angular-planar power spectrum, we can ask 



how good this model fit the observational data once it is calculated using (62). All we 
need is a simple chi-square (\ 2 ) goodness-of-fit test, which in our case can be written 
in the following generalized form: 

^ \f> Im Qlm\2 

(xlYi = E 1 ( j ' , (63) 

m=— I t 

in which I and I are the angular and planar degrees of freedom, respectively, and where 
a l e m is just the standard deviation of the estimator C e lm . The (21 + 1) _1 factor accounts 
for the 21 + 1 planar degrees of freedom, and was introduced for latter convenience. 

In §[7] we will apply this test to the 5- year WMAP temperature maps in order to 
check the robustness of the ACDM model against the hypothesis of SI. Before that, we 
shall stop and digress a little about how observational uncertainties should be included 
in our analysis. 
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6 Theory v. Observations: cosmic and statistical vari- 
ances 



Until now we have been concerned with the formal aspects of non-Gaussian and sta- 
tistically anisotropic universes, and how model-independent tests might be designed to 
detect deviations of either gaussianity or statistical isotropy. We will now discuss how 
such tests can be carried out and interpreted once we possess cosmological data. 

In a great variety of tests, statistical tools are designed to detect particular devia- 
tions of gaussianity or SI from cosmological data like, for example, the CMB. The final 
outcome of these tests is usually a probability (a pure number), which should be inter- 
preted as the chance that a Universe like ours might result from an ensemble of "equally 
prepared" Gaussian and SI universes. An anomaly in CMB is usually understood as a 
measure of how unlikely is a particular feature of our Universe according to this specific 
test. The multipole vector statistics, for example, when applied to a large number of 
simulated (Gaussian and SI) CMB maps, show that only ~ 0.01% of these maps have 
a quadrupole-octupole alignment as strong as WMAP maps [26, 23J. 

There are two points to keep in mind when carrying this type of analysis. The first 
is that a particular "detection" may always turn out to be a statistical fluctuation re- 
vealed by one specific tool. The robustness of an anomaly then depend on the number 
of independent tests pointing to the same result. The second is that the implementation 
of statistical tools is sensitive to the way we extract information from data, requiring an 
accurate separation between cosmological signal and astrophysical/instrumental noise. 

In this section we present a critical review of the standard procedure used to imple- 
ment cosmological tests. We show that it does not account for the intrinsic uncertainties 
of cosmological observations, which may possibly lead to an under/over estimation of 
anomalies. We then present a generalization of this process which naturally accounts 
for these uncertainties. 

6.1 Standard Calculations 

Suppose a; is a random variable predicted by a particular modeQ and that cosmological 
observations of this quantity returned the value xq. We would like to calculate the 
probability, according to this model, that in a random Universe we would have x < Xq. 
Assuming that P t h is the (normalized) probability density function (pdf) of x, this 
probability is commonly defined to be 



° Rigorously, x is only one realization of a random variable X, which is a real-valued function defined 
on a sample space. By the same reason we should not call the ae m 's in Eq. ^ random variables, 
though we shall stick to this nomenclature throughout this text. 




(64) 
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The probability of having x > Xq is then simply given by P> = 1 — P< . See the figure 
below. 




X Q 



Figure 5: Probability density function for the theoretical variable x. The shaded are is 



the probability Eq. (64). 



If P< is found to be too small or, equivalently, too high, we might be tempted 
to interpret xq as "anomalous" according to this model. However, this definition of 
probability assumes that xq was measured with infinite precision, and so it says nothing 
about an important question we must deal with: typically, the measurement of xq has 
an uncertainty which needs to be folded into the final probabilities that the observations 
match the theoretical expectations. 

Moreover, since no two equally prepared experiments will ever return the same value 
Xq, our measurements should also be regarded as random events. In a more rigorous 
approach, we would have to consider xq itself as one realization of a random variable, 
conditioned to the distribution of the signal. In the case of CMB, however, this would 
be only part of the whole picture, since the randomness of the measurements of Xq 
should also be related to the way this data is reduced to its final form. This happens 
because different map cleaning procedures will lead to slightly different values for xq. 
This difference induces a variance in the data which reflects the remaining foreground 
contamination of the temperature map. We will elaborate more on this point through 



a concrete example, after we show how Eq. (64) may be changed in order to include 



the indeterminacy of cosmological measurements. 



6.2 Convolving probabilities 

The question we want to answer is: how to calculate the probability of x being smaller 
than our measurements when the latter are also random events? Let us suppose that 
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our measurement is described by the random variable y and that xq is its most probable 
value. The probability of having x < y is simply the probability of having 

z = x — y < 

for some particular realization of the variable y. It should be clear by now that if we 
know the pdf of z, the probability we are looking for is simply the area under this 
distribution for — oo < z < 0. The probability P< of z being smaller or equal to zero 
can be calculated as: 

P< = P{{x,y)\x - y < 0} = JJ V(x,y)dxdy. 

x—y <0 

Now, under the hypothesis of independence of x and y we have V(x, y) = Vth{x)V bs{y) , 
where V b s is the (normalized) probability distribution function of the variable y. We 
can therefore rewrite the last expression as 

V ohs {y)dy. 

If we now hold y fixed and do the change of variables x = u + y, we get 

-o r roo 





POO 


ry 


p< = 




J V t h{x) dx 




' — oo 


_J — oo 



V th {u + y)V ohs {y) dy 



du . 



where we have changed the position of the integrals. The reader will now notice that 
term inside brackets is precisely the pdf we were looking for. Since there is nothing 
special about the variable y, we can equally well hold x fixed and repeat the calculus, 
obtaining the symmetric version of this result. In fact, the final pdf for z is nothing 
else than the convolution of the pdf 's of each variable |77| : 



V(z) = (P obs * V th ) (z) = / V ohs (y)V th (z±y)dy 

J — oo 

/oo 
V ^[x =F z)V t h{x) dx . (65) 
-oo 

where the plus (minus) sign refers to the difference (sum) of x and y. Integrating this 
pdf from (— oo, 0] we get our answer 



P< 



V{z)dz. 



(66) 



As a consistency check, notice that in the limit where observations are made with 
infinite precision, V ha{v) becomes a delta function and we have: 



P< 



6(y- x )Vth(z ± y)dydz 



— CO J — oo 



±x 



V t h{x)dx 
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which agrees with our previous definition. The reader must be careful, though, not to 
think of Eq. (64) as some lower bound to Eq. (66). Since none of the pdf's appearing 
in Eq. (65) are necessarily symmetric, a large distance from xq to the most probable 
(not the mean!) value x would not, by itself, constitute sufficient grounds to claim that 
the measured value of this observable is "unusual" in any sense, simply because a large 
overlap between the two pdf's can render the result usual according to Eq. (66). 

To illustrate this point, let us calculate P< using Eq. (64) and Eq. (66) for the 
pdf's which appear in Fig. 6. For pedagogical reasons, we have chosen Vth and V a bs as 
positively (Maxwell distribution) and negatively (Gumbel distribution) skewed pdf's, 
respectively. The convolved distribution appears as the solid (black) line. For these 
pdf's, Eq. (64) gives 99.2%, while Eq. (66) gives 93.6% of chance of x being smaller 
than the observed xq; all pdf's were normalized to one. 




Figure 6: Probability density functions for x (blue, dashed line), y (red, dot-dashed 
line) and z (solid line.) The shaded area gives the probability of x being smaller than 
Xq (dashed vertical line.) See the text for more details. 



7 A% 2 test of statistical isotropy 

Although the last example was constructed to emphasize an important feature of the 
formalism developed in ^6j cosmological observables designed to measure deviations 
of either gaussianity or statistical isotropy will often follow asymmetric distributions. 
The intrinsic uncertainties of cosmological measurements, specially the ones originating 
from map cleaning procedures, may be crucial when searching for any map's anomalies. 
We will now make a concrete application of this formalism using the angular-planar 
chi-square test developed in §5.1| 
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7.1 ACDM model 



In the simplest realization of the ACDM model, the covariance matrix of temperature 
maps is determined by (a^ imi a^ 2m2 ) = (— l) m2 C^ 1 5£ 1 ^ 2 5 mi _ m2 . Using this expression in 
(60), we find that 

(67) 

m = 0, then C, 00 = C f . 



Clm (SI) r r 

^ — WO/otf mO • 

On the other hand, if the only non-zero C l e m, s are given by I 



Therefore, statistical isotropy is achieved if and only if the angular-planar power spec- 
trum is of the form (67). Since we are only interested in nontrivial planar modulations, 

(68) 



we will restrict our analysis to the cases where I ^ 0, that is 



C 



Irn 







(Z>2) 



where, we remind the reader, the parity of t comes from the symmetry (24). 

For this particular model, we can also calculate the covariance matrix of the es- 
timator (62) explicitly. Using the null hypothesis above, we find after some algebra 
that 



(69) 



This matrix has some interesting properties. First, note that the planar degrees of free- 
dom are independent in this case (which justifies the (2/ + 1) -1 ) factor introduced in Eq. 
(|6~3~]).) Second, its diagonal elements are given by the variance (<J l e m ) 2 = ((C/ m )*C/ m ), 
which now becomes m-independent: 



i\ 2 



7T- > C ei Ce 2 [I l e f e2 



(70) 



Therefore, for the particular case of the ACDM model, the chi-square test (63) gets 



even simpler. Using (68) and (70), we find 
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(71) 



It is now clear that if the data under analysis are really described by this model, then 
it must be true that 



(ixlYe) 
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where we have used (70). This shows that any large deviation of this test from unity 
will be an indication of planar modulation in temperature maps, up, of course, to error 
bars. For convenience, let us define a new quantity as 



2\l 



xi = (xi) 



(72) 
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which will quantify anisotropics whenever \l is significantly positive or negative. 

This generalized chi-square test furnishes a complete prescription when searching 
for planar modulations of temperature in CMB maps. We emphasize, though, that 
for a given CMB map, the chi-square analysis must be done entirely in terms of that 
map's data. Since we are performing a model-independent test, we are not allowed to 
introduce fiducial biases in the analysis (for example, by calculating <j\ using C™ odel ), 
which would only include our a "priori prejudices about what the map's anisotropics 
should look like. Since the C/s are, by construction, a measure of statistical isotropy. 
Consequently, an "anomalous" detection of Cgs is by no means a measure of statistical 



anisotropy, and it is this particular value that should be used in ( 72 ) if we want to find 
deviations of isotropy, regardless of how high/low it is. 

7.2 Searching for planar signatures in WMAP 



In order to apply the test (72) to the 5-year WMAP data [T4"t [TT] . we will define two 
new variables 

x = (,X 2 )e(th) j V = (x 2 )^(obs) > 
which will be jointly analyzed using the formalism of section £j6j Still, there remains 
the question of how to obtain their pdf 's. These functions can be obtained numerically 
provided that the number of realizations of each variable is large enough, since in 
this case their histograms can be considered as piecewise constant functions which 
approximate the real pdf's. For the case of the (theoretical) variable x defined above 
we have run 2 x 10 4 Monte Carlo simulations of Gaussian and statistically isotropic 
CMB maps using the ACDM best-fit C/s provided by the WMAP team [78J. With 
these maps we have then constructed 2 x 10 4 realizations of the variable x. 

The simulation of the (observational) variable y is more difficult, and depends on the 
way we estimate contamination from residual foregrounds in CMB maps. As is well- 
known, not only instrumental noise, but systematic errors (e.g., in the map-making 
process), the inhomogeneous scanning of the sky (i.e., the exposure function of the 
probe), or unremoved foreground emissions (even after applying a cut-sky mask) could 
corrupt - at distinct levels - the CMB data. 

Foreground contamination, on the other hand, may have several different sources, 
many of which are far beyond our present scopes. However, since different teams apply 
distinct procedures on the raw data in order to produce a final map, we will make the 
hypothesis that maps cleaned by different teams represent - to a good extent - "indepen- 
dent" CMB maps. Therefore, we can estimate the residual foreground contaminations 
by comparing these different foreground-cleaned maps. 

In fact, the WMAP science team has made substantial efforts to improve the data 
products by minimizing the contaminating effects caused by diffuse galactic foregrounds, 
astrophysical point-sources, artifacts from the instruments and measurement process, 
and systematic errors |84" t 185 j . As a result, multi-frequency foreground-cleaned full-sky 
CMB maps were produced, named Internal Linear Combination maps, corresponding 
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Full sky maps 


References 


Hinshaw et. al. 


HUES] 


de Oliveira-Costa et. al. 


m 


Kim et. al. 


m 


Park et. al. 


[82j 


Delabrouille et. al. 


[83j 



Table 1: Full-sky foreground cleaned CMB maps from WMAP data used in our analysis 
to estimate the variable y (see the text for more details.) Note that the reference [81J 
includes the analysis of maps from the three and five years WMAP releases. 



to three and five year WMAP data [14"[ 179 j . To account for the mentioned randomness, 
systematic, and contaminating effects of the CMB data, we will use in our analyses 
several full-sky foreground-cleaned CMB maps, listed in Table [TJ which were produced 
using both the three and five year WMAP data. 

The prescription we adopt to determine the distribution of the observational variable 
y is as follows: we simulate Gaussian random a^ m 's in such a way that their central values 
are given by the five year ILC5 data [79| [1~4"] . and with a variance which is estimated 
from the sample standard deviation of all the maps listed in Table [T] So, for example, 
suppose we have n different full-sky temperature maps at hand and we want to estimate 
the randomness inherent in the determination of, let's say, 032- Therefore, we take 

A/K L 2 Cf >32) -> a 32 , (73) 

with 



j n \ n 

^32 = a 7 VVa 3 2 - a 32 ) 2 and a 32 = - V" a\ 2 , (74) 

\ n — 1 ^— ' n z — ' 

\ 1=1 1=1 

where Af(fi, er) represents a Gaussian distribution with mean \i and standard deviation 
a. Note that if the residual contamination is indeed weak, then the sample variance 
above will be small, and our procedure will reduce to the standard way of calculating 



probabilities. As for the use of a Gaussian in (73), this choice was dictated not only 
by simplicity, but rather by the fact that the propagation of uncertainties in physical 
experiments are usually assumed to follow a normal distribution. Note however that 
there are some instrumental uncertainties, such as beam or gain uncertainties, which 
will not in general follow normal distributions. In fact, some of them may even fail to 
be additive, meaning that our convolution formula will be inapplicable in these cases. 
In our analysis, we have focused only on foreground residuals, where the normality 
hypothesis is reasonable^} 

6 We thank the referee for pointing out this important aspect of the formalism. 
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l\t 


2 


3 


4 


5 


6 


7 


8 


9 


10 


2 


81.1% 


73.7% 


54.1% 


6.1% 


80.7% 


46.4% 


36.9% 


47.5% 


81.8% 


4 


74.0% 


72.6% 


55.0% 


39.6% 


74.2% 


93.2% 


51.6% 


55.4% 


56.3% 


6 


78.1% 


80.7% 


69.3% 


52.3% 


33.6% 


80.0% 


95.0% 


50.3% 


82.2% 


8 


63.5% 


87.7% 


18.8% 


51.5% 


21.4% 


66.4% 


31.6% 


27.5% 


82.3% 


10 


67.9% 


50.0% 


61.0% 


8.7% 


37.7% 


59.5% 


36.6% 


29.2% 


35.7% 



Table 3: Final probabilities of obtaining, in a random ACDM Universe, a chi-square 
value smaller or equal to {x 2 )e( bs)' as gi ven by full-sky temperature maps. 



7.3 Full-sky maps 

Following this procedure, we have used the full-sky maps shown in Table Q to con- 
struct 10 4 Gaussian random d£ m 's, which were then used to calculate 10 4 realizations 
of y = {x 2 ) l £(obs)- With those variables we constructed histograms which, together with 
the histograms for the (full-sky) variable x, were used to calculate the final probability 
(66). We have restricted our analysis to the range of values (£, I) G [2,10], since the 
low multipolar sector (i.e., large angular scales) is where most of the anomalies were 
reported. The resulting histograms and pdf 's are shown in Fig. [7j and the final proba- 
bilities we obtained are show in Table [3] 
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Figure 7: Histograms for (x 2 )f( t h) (blue), (x 2 )£( bs) (purple) and for the difference 
(x 2 ) l £(th) ~ (^ 2 )^(obs) ( s °lid, red line). We show only a few representative figures. The 
final probabilities are shown in Tableland correspond to the area under the solid curve 
from — oo to 0. All pdf's are normalized to 1. 

Overall, our results show no significant planar deviations of anisotropy in WMAP 
data. The most unlikely individual values in Table ^ are in the sectors (l,£) given by 
(2, 5), (10, 5), (4, 7) and (6, 8), and are all above a relative chance of 5% of either being 
too negative [(2,5), (10,5)] or too positive [(4,7), (6,8)]. However, it is perhaps worth 
mentioning that not only the individual values of (x 2 )e are relevant, their coherence 
over a range of angular or planar momenta also carries interesting information. So, for 
example, a set of {x 2 Ye s which are all individually within the cosmic variance bounds, 
but which are all positive (or negative) can be an indication of an excess (or lack) 
of planar modulation. This type of coherent behavior appears in the following cases: 
(x 2 )L (x 2 ) l 3 an d, to a lesser extent, (x 2 )e ~ see Table [3j The angular quadrupole £ = 2, 
as well as the angular octupole I = 3, have all positive planar spectra (for all values 
of I which we were able to compute), indicated by probabilities larger than 50%. The 
planar hexadecupole I = 4 also has 8 out of 9 angular spectra assuming positive values 
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(only i = 5 is negative). 

The data analyzed in this Section relates to the full-sky maps, which are certainly 
still affected by residual galactic foregrounds. The reader interested in the complete 
analysis, including data from masked CMB maps, can check reference |28| . 
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8 Conclusions 



We know our Universe is not perfectly Gaussian, or homogeneous, or isotropic. The 
deviations from an idealized picture (or the lack thereof), whether predictably small or 
surprisingly large, can tell us a great deal about the Universe we live in. Since the types 
of physical mechanisms behind deviations from perfect gaussianity, homogeneity or 
isotropy are typically very different, we should try to measure these individual features 
separately - whenever possible or practical. 

Recently it has been suggested that some of the most discussed anomalies in the 
CMB can be explained away [86J, or that the evidence for them is statistically weak 
[87J. But even if it turns out that our Universe is a plain vanilla kind of place, where 
everything goes according to the inflationary theorist's dreams, we would still need to 
analyze it with tools that allow us to check the standard picture against the data. In 
addition, local physics (related to the solar system, or our galaxy), as well as instru- 
mental quirks, tend to leave imprints on the CMB which are clearly anisotropic, but 
have a certain coherence which can be detected, and possibly corrected for, with the 
help of these checks. 

However, in an era where at least the large-scale maps of the CMB are likely to 
remain basically unchanged, we should be careful not to over analize the data with the 
benefit of an ever greater hindsight (put another way, a posteriori conundrums only get 
worse with time.) This can only be achieved if we find natural and generally agreed-upon 
classifications of the types of deviations that may occur, without too much guidance 
from what the data is telling us. We believe that focusing on the possible underlying 
symmetries, with perhaps some guidance from group-theoretic arguments, is one way 
to settle these issues. We have presented a few methods along these lines, one using 
multipole vectors, the other using a natural generalization of the two-point correlation 
function - and other methods have been presented in this Review. 

Perhaps the best indication that we are on the right track is the fact that most of 
these methods are applicable in other areas of physics and astronomy - and that in 
some cases we have adapted tests of anisotropy from other areas, such as scattering 
theory and the theory of angular momentum in quantum mechanics. So, even if these 
anomalies eventually perish, they will be survived by the powerful methods that have 
been devised to test them. 
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A Geometrical identities and derivations 
Wigner D-functions 

From the unitarity of the rotation operators D(R), we have 



m 



If uj 2 — u 1 1 , then using the identity D e m „ m (uj x ) = D^, 



m"m 



(uj), we find 



m 



Gaunt integral 



The definition of the Gaunt integral used in this paper is 




3-j symbols 



We present here some useful identities related to the 3-j symbols: 



• Isotropic limit 
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Parity and permutations 



h h I 
mi ni2 m 



m mi m2 

W2 mi m 
h h 



1 



1 



\h+h+l 



-mi 



I 



-m2 —in 



Orthogonality 

h h 

E E 

mi=—h rn2=—h 
h+h h 

E E < 2i + !) 

h=\fo-h\ mi=—h 



h h h 

mi m 2 m% 

h h h 

mi m 2 m3 



li I2 / 3 

mi m 2 m' 3 

li I2 I4 

mi m' 2 m' 3 



dl 3 l' 3 dm 3 m' 3 

2/3 + I 



Y(-iy- m ( 1 1 i) = V2f+i5 eo 

^ v 7 \ m -m / 

m=-Z V 7 



The last expression is particularly useful in the derivation of (67) 



Derivation of (60) 



We start by equating expressions (58) and (20) 



^^^E+icf^fcos in y lmill] 



t Lm 



2^ ( a ft m i4m 2 )^i™i (^i)E£ 2m2 (n 2 ) . (75) 

£i,mi £2,i»2 



As mentioned in the main text, the inversion of C\ m as a function of the a^ m 's is not a 
trivial task, since the angles (B, $, 0) depend non-linearly on the angles (81, tpi, 82, ^2)- 
The easiest way to achieve this goal is to pick up a coordinate system where only the 8 
dependence is present. After integrating it out, we rotate our coordinate system using 
three Euler angles to recover back the (0, <3>) dependence, which can then be integrated 
with the help of some Wigner matrices identities. We start by positioning the vectors 
hi and fi2 in the xy plane, i.e, we chose hi = (vr/2, <pi), h 2 = (V/2, </> 2 ). By (59) we then 
have cos$ = cos(0i — <p2)- Using the relation 



Y^ m 0/2,0) = X im e 



irruf) 



.t+rn 
•1 2 



21+1 (l+m-1)!! (l-m-1)!! 
4tt (i+m)\\ (i-m)\\ 



if £ + m e 2N 
otherwise 

(76) 
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we can integrate the 9 dependence on both sides of ( 75 ) . This gives us 

— V"(^ m F/ m (0,0) = V" ^ ( a hm l a£ 2 m 2 )4imie 2 m2 ( 77 ) 
l,m Ki,mi (.2,^2 

where we have introduced the following definition 

C« = -KmMra2 f ^(cosfa - ^e^^tycosfa - <f 2 )) . (78) 



We need now to integrate out the and $ dependence in the right-hand side of 
(77) which was hidden due to our choice of a particular coordinate system. In order 
to do that, we keep the vectors -hi and fi2 fixed and make a rotation of our coordinate 
system using three Euler angles u> = {a,/3,7}. This rotation changes the coefficients 
C l e m, s and a&n's according to 



a £n 



^ D Lm> M<W , C l P - D l mm , (u)C l e m 



where C l e m and a lm are the multipolar coefficients in the new coordinate system and 
where D l mm ,(u) are the elements of the Wigner rotation matrix. The advantage of 
positioning the vectors fix and fi2 in the plane xy is that now the angles and $ are 
given precisely by the Euler angles ft and 7, regardless of the value of a 

l,m l,m' \ m / i,m' 

where in the last step we have used YJ m (0,0) = a/ (21 + l)/4ir 5 m Q. Therefore, in our 
new coordinate system we have (dropping the "~" in our notation) 

l,m t\,m\l2,m2 m^rn^ 

We may now isolate C l e m using the identities |41j 

fduD l \ (uj)D h , (u)D h , (u) = 8n 2 ( h , h , h , ) ( h k h 
where du = sin f3d(3dad'-f, to obtain 

-j===C l e m = 27T £ £ ( ffl <imi fl lm 2 ) £ C'jfemi (_]_)™2+m 2 +m 



A m 2 



lx £2 I \f £1 4 Z 
m' x — m' 2 y V mi — m2 — m 

If we now redefine — m2 — >■ m2 and note that the first 3-j symbol above is identically 
zero unless vn! x = m' 2 , we finally obtain (60). 
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Some properties of the integral (61) 



The geometrical coefficients denned in (61 ) has many interesting properties which 
can be explored in order to speed up numerical computation of (60). First, we note 
that it is symmetric under permutation of £\ and £2. 



ji,t 



£i £ 2 I 
m — m 



m 

Erf f_-\\m+h+h+l 
1 £ 2 m£ 1 m\ l ) 

m 

Ejl ( _y\m+2(e 1 +e 2 +l) ( 

1 e 2 m£ 1 m\ 1 J I 



-m m 



£2 £1 I 
m —m 



Some of the other properties are a consequence of the integral If mt m defined in (78). 
We may note for example that, due to the symmetry of the Xi m coefficient definedin 
(76), we will have: 



T l,t 



0, for any {{£i,£ 2 ) G N \£\ +£ 2 = odd} . 



Furthermore, the A^ m coefficients restrict the m summation above to their values which 
obey: m + £\ + £2 = even. If we further notice that ( 78 ) is proportional to the integral 
of a integral of the form J* P^(cos#) cos rnOdO, and that this integral is zero unless 
£ + m = even, we conclude that 



rid 



0, for any {(£ u £ 2 ,£) E N \£ x + £ 2 + £ = odd} . 



Besides, using the fact that the integral f -^ P^(cos 9) cos m9 d6 is zero for any m < 
we find 

= , for any {(4, £2, £) G N \£i < £, £2 < £}■ 
We finally comment on the special case where / = 0, for which we have 



(-1)* hprl \s 



However 

t' 

/ j 1 l'mi'r 



p.««-*) 1 1 (2f+1)(£ 't m ^ ) " (f : m ^ ) "e'- 



\m=—i' 



in 



(£' + m)\\ (£'-m)\\ 



2f + 1 
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Pe(x)Pi/(x)dx 



2tt 
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where in the derivation above we have made use of the Fourier series expansion of the 
Legendre polynomial. So we conclude that 



l 44 



2ttV2£i + 1 



(79) 



which is needed in the derivation of (67). 
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